Medical Acronym Disambiguation Using Online Sources

نویسندگان

  • Janet Rajan
  • Karen C. Davis
  • Pawel Matykiewicz
  • Wlodzislaw Duch
  • John Pestian
چکیده

Hospitals produce millions of patient records consisting of clinical annotations containing extensive usage of abbreviations. The data in these clinical annotations are an excellent source for bioinformatics research but the use of abbreviations can create ambiguity. The main objective of our research is to develop a software application that takes a medical acronym as input and accesses medical and pharmaceutical websites to retrieve information from articles containing the acronym along with a user selected full-form of the acronym. The retrieved information consists of article title, authors, publication date, article abstract, and Medical Subject Header (MeSH) details. The information is used by researchers in the Biomedical Informatics Division of the Cincinnati Children’s Hospital Medical Center as part of a research effort for reducing the ambiguity created by the use of acronyms. Our contribution to this research effort is a framework for disambiguation that accesses online sources; we design and populate an internal database that can be used in future research efforts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Identification of Synonyms in Biomedical Acronym Sense Inventories

Acronyms are increasingly prevalent in biomedical text, and the task of acronym disambiguation is fundamentally important for biomedical natural language processing systems. Several groups have generated sense inventories of acronym long form expansions from the biomedical literature. Long form sense inventories, however, may contain conceptually redundant expansions that negatively affect thei...

متن کامل

Acronym Disambiguation Using Word Embedding

According to the website AcronymFinder.com which is one of the world's largest and most comprehensive dictionaries of acronyms, an average of 37 new human-edited acronym definitions are added every day. There are 379,918 acronyms with 4,766,899 definitions on that site up to now, and each acronym has 12.5 definitions on average. It is a very important research topic to identify what exactly an ...

متن کامل

Kernel Methods for Word Sense Disambiguation and Acronym Expansion

The scarcity of manually labeled data for supervised machine learning methods presents a significant limitation on their ability to acquire knowledge. The use of kernels in Support Vector Machines (SVMs) provides an excellent mechanism to introduce prior knowledge into the SVM learners, such as by using unlabeled text or existing ontologies as additional knowledge sources. Our aim is to develop...

متن کامل

A Comparative Study of Supervised Learning as Applied to Acronym Expansion in Clinical Reports

Electronic medical records (EMR) constitute a valuable resource of patient specific information and are increasingly used for clinical practice and research. Acronyms present a challenge to retrieving information from the EMR because many acronyms are ambiguous with respect to their full form. In this paper we perform a comparative study of supervised acronym disambiguation in a corpus of clini...

متن کامل

Semi-Supervised Maximum Entropy Based Approach to Acronym and Abbreviation Normalization in Medical Texts

Text normalization is an important aspect of successful information retrieval from medical documents such as clinical notes, radiology reports and discharge summaries. In the medical domain, a significant part of the general problem of text normalization is abbreviation and acronym disambiguation. Numerous abbreviations are used routinely throughout such texts and knowing their meaning is criti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007